/ 

r 


FAC  TON  3 


u> 


^  x  (j 

-TiMaMIOMUlY 

UNS  .CANv^l  1 


IS^CAN^/ 


nations  project 

0EN  FACTOR  2 


mat,onaiutechhical 
information  SERVI 


a 


DISCLAIMER  NOTICE 


THIS  DOCUMENT  IS  THE  BEST 
QUALITY  AVAILABLE. 

COPY  FURNISHED  CONTAINED 
A  SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


Ihc  Rliacnuionalil'y  of  Nations  Project 
Hr  Jver.si  l:y  of  Ihv/nii 


RESEARCH  REPORT  KO.  15 

Investigations  into  Alternative  Techniques  for 
Developing  Empirical  Taxonomies: 

The  Results  of  Two  Plasmodes 

Warren  R.  Phillips 


Oct  ober  1968 


Prepared  in  Connection  with  the  National  Science  Foundation, 
Grant  No.  GS-1230,  and  the  Advanced  Research  Projects  Agency, 
ARPA  Order  No.  1063,  and  Monitored  by  the  Office  of  Naval 
Research,  Contract  //N00014-G7--A-0387-0003  / 


This  document  has  been  approved  for  public  release  r.rd  sale;  its 
distribution  is  unlinited  and  reproduction  in  whole  ev  in  ya’*t  i 
permitted  for  any  purpose  ol  the  United  States  Covcrivont 


TABLE  OV  CON'ILNTS 


ABSTRACT 

1.  INTRODUCTION 

2.  PLASMODE 

2.1  Factor  Groups  ,  , 

2.2  Hierarchical  Clustering  Scheme 

2.3  Conclusions  from  Plasntodcs 

3.  SUBSTANTIVE  EXAMPLE  . 


Page 

3. 

2 

3 

4 

16 

25 

31 


S»\  unK  CljsM(iijiti»jj 


DOCUMENT  CONTROL  DATA  •  R  &  D 

jmifih-  «  of  tuft,  lntjy  %*t  wjMrwrr  *ml  mj/MiHtf  ^woriNM  miiw  /if  iMfffrd  wjirw  >/m*  mnN  i»  ff.«»Mfloifi 


i  report  title 

Investigations  into  Alternative  Techniques  for  Developing  Empirical  Taxonomies: 
The  Results  of  Two  Plasmodes 

4  OEtCRiPTivE  NOTE*  ( Typ a  of  report  and  Inel u.iv*  dataa) 

Research  Report  No.  15 

*.  Au  ThOkiII  (Firat  Mm*!  middle  initial,  laat  nama) 

Warren  R.  Phillips 


10.  DISTRIBUTION  STATEMENT 

This  document  has  been  approved  for  public  release  and  sale;  its  distribution  is 
unlimited  and  reproduction  in  whole  or  in  part  is  permitted  for  any  purpose  of  the 
United  States  Government. 


IS.  ABSTRACT 

This  is  k  research  report  on  the  applicability  of  grouping  techniques  to  the  data  of 
international  relations.  While  it  is  still  too  early  to  make  strong  recommendations  for 
the  "best"  grouping  method,  the  results  of  this  paper  suggest  some  important  character¬ 
istics  of  current  techniques  which  should  be  considered  in  selecting  a  grouping  tech¬ 
nique  . 

Several  goals  guided  this  research:  First,  to  demonstrate  the  characteristics  of 
current  clustering  techniques  by  employing  experiments  with  known  outcomes — plasmodes; 
secondly,  to  suggest  a  measure  of  similarity  that  is  advantageous  in  grouping  experiments 
when  correlations  are  of  less  meaning;  and  thirdly,  to  employ  the  techniques  investigated) 
in  the  plasmode  to  a  substantive  problem  from  international  relations. 

Data  were  collected  on  the  mileage  between  a  number  of  cities  in  the  United  States. 

The  cities  were  arranged  into  two  matrixes:  one  of  22  cities,  which  divided  into 
obvious  groups;  the  other  of  60  cities,  with  no  discernible  groups.  The  distance  matrix 
was  then  rescaled  to  a  similarity  matrix  varying  for  0.0  to  1.0,  where  1.0  is  the  closest. 
The  principal  components  of  this  matrix  were  computed  and  the  similarities  matrix  was 
also  hierarchically  decomposed  employing  an  algorithm  supplied  by  S.  C.  Johnson.  The 
groups  delineated  by  these  techniques  were  compared  geographically  on  maps  of  the  United 
States. 

The  research  suggests  that  direct  factor  analysis  techniques  seem  core  appealing  than 
the  hierarchical  clustering  schemes  for  describing  the  structure  of  the  spaces  defined 
by  the  plasmodes.  Hierarchical  clustering  schemes  are  useful  when  the  researcher  wishes 
to  break  up  a  dense  cluster  of  entities,  however. 
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ABSTRACT: 


This  is  a  research  report  on  the  applicability  of  grouping  techniques 
to  the  data  of  international  relations.  While  it  is  still  too  early  to  make 
strong  recommendations  for  the  "best:"  grouping  method,  the  results  of  this  paper 
suggest  some  important  diaractcristics  of  current  techniques  which  should  be 
considered  in  selecting  a  grouping  technique. 

Several  goals  guided  this  research:  First, to  demonstrate  the 
characteristics  of  current  clustering  techniques  by  employing  experiments  with 
known  outcomes — plasmodes;  secondly,  to  suggest  a  measure  of  slmi  larity.  that  is 
advantageous  in  grouping  experiments  when  correlations  are  of  less  meaning;  and 
thirdly,  to  enplov  the  techniques  investigated  in  the  plasmode  to  a  substantive 
problem  from  international  relations. 

Data  were  collected  on  the.  mileage  between  a  number  of  cities  in  the 
United  States.  The  cities  were  arranged  into  two  matrixes:  one  of  22  cities, 
which  divided  into  obvious  groups;  the  other  of  60  cities,  with  no  discernible 
groups.  The  distance  matrix  was  then  rescaled  to  a  similarity  matrix  varying 
for  0.0  to  1.0,  where  1.0  is  the  closest.  The  principal  components  of  this 
matrix  ware  computed  and  the  similar. tics  matrix  was  also  hierarchically 
decomposed  employing  an  algorithm  supplied  by  S.  C.  Johnson.  The  groups  delin¬ 
eated  by  these  techniques  were  compared  geographically  on  maps  of  the  United 
States . 

The  research  suggests  that  direct  factor  analysis  techniques  seen  more 
appealing  than  the  hierarchical  clustering  schemes  for  describing  the  structure 
of  the  spaces  defined  by  the  pin  snodes.  Hierarchical  clustering  schemes  arc 
useful  when  the  researcher  wishes  to  break  up  a  dense  cluster  of  entities.,  however. 
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1.  INTRODUCTION 

Grouping  nations,  objects,  individuals,  or  cases  by  types  is  a  hr,  >!c 
step  in  describing  phenomenon  and  building  science.  The  virtue  of  typing  is 
that  it  enables  parsimonious  descriptions  of  objects  and  facilitates  reliable 
predictions  about  them  based  upon  their  croup  identity.  Classification  is  the 
process  of  ordering  cases  into  groups  that  best  represent  certain  empirically 
measured  relations  of  contiguity,  similarity,  or  both.  While,  this  process 
is  certainly  not  new  to  the  social  sciences,  it.  has  been  more  readily  asrc-  e  -  -  , 
ciated  with  biological  taxonomies.  In  taxonomic  work  in  biology-~v.,hcre  typal 
distinctions  have  been  built  in  a  roll-systematic  (non-quantitative)  fashion, 
it  is  becoming  increasingly  clear  that  a  more,  systematic  basis  is  required 
for  scientifically  meaningful  classification  (Sokal  and  Sneatb,  1963). 

In  political  science  voices  are  being  raised  for  the  use  of  systematic 
grouping  techniques  (Kussett  1967,  Rommel  1969,  Broms,  1966).  .The  problem  with 
the  prevailing  types  in  political  science  is  that  the  rationale  underlying 
the  categorization  is  often  not  explicit.  It  is  not  clear  whether  our  "types" 
really  divide  different  kinds  of  variance.  If  we  are  to  deal  in  types  a  clear 
and  empirical  basis  for  the  distinctions  must  be  made. 

Several  techniques  for  grouping  entities  have  been  suggested  recently. 

I  have  chosen  two  techniques  which  seem  to  be  most  fruitful  for  use  in  inter¬ 
national  relations.  In  order  to  investigate  the  applicability  of  these 
techniques,  a  special  type  of  "experiment"  has  been  designed.  This  type  of 
experiment  is  one  in  which  the  outcome  is  known  before  the  experimenter  begins 
his  r.T-.ilysis  and  is  termed  a  pi  asm  ode .  Flasmodes  are  used  when  an  analyst 
is  interested  in  discerning  the  characteristics  of  his  analytical  tools  and 
not  the  characteristics  of  the  entities  under  study.  Cat tell  suggests  the  use 
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of  pl;.s!iudc.'-  I  o  ji'.v»  iligato  th<  properties  of  n  specific  Fiothwd  (unite  11, 

3  06V)  .  lio  cor  iMits.: 

•j Itc  useful  nor.  of  a  plnsnodc  resides;  first  in  helping  uo 
evaluate  the  power  of  various  cxpcriiiouial  nml  analytical 
procedures  to  show  ili.  t  i  Ik*  data  in  any  experiment  doer,  net  fit 
o  particular  Model.  For,  before  applying  the  investigatory 
procedures:  to  be  evaluated,  uo  know  already,  by  virtue  of  our 
"back  stage"  information,  what  actual  structures  should  be 
'  revealed.  From  seeing  the  degree  to  which  they  are  revealed,  v:e 

»  are  able  to  evaluate  the  relative  soundness  of  the  given  Method* 

and  inferential  procedures.  In  all  ordinary  investigations,  as 
distinct  front  investigations  with  a  plasmode,  the  scientist  leans  on 
the  belief  that  his  experimental  methods  and  statistical  analyses 
are  capable  of  revealing  whether  the  data  has  a  structure  which 
fits  some  particular  model ,  anc!  on  this  assumption  he  proceeds  to 
"substantive."  discoveries  in  the  given  domain.  In  working  with  a 
plasmodc,  precisely  the  converse  approach  is  made,  since  the 
technique  not  the  substantive  area  is  investigated,  lie  knows 
exactly  what  the  "substantive"  structural  answer  mist  be,  and  he 
is  setting  out  to  discover  things  about  the  method. .. (pp.  304) 

2 .  PLASMODE 

In  order  to  keep  the  structure  of  a  plasmode  in  its  simplest  form, 
a  two  dimensional  problem  was  chosen  for  this  investigation.  When  we  deal 
with  only  two  dimensions,  we  can  plot  entities'  positions  and  then  visually 
search  for  clusters.  For  this  purpose  a  common  road  map  mileage  chart  of 
several  U.  S.  cities  seems  ideal.  The  mileage  chart  becomes  a  matrix  of 
distances  between  all  of  the  cities  identified  in  the  matrix.  The  matrix 
is  square  (cities  across  cities)  and  symmetric  (the  distance  from  Chicago 
to  Detroit  is  the  same  as  from  Detroit  to  Chicago).  The  grouping  of  cities 
ought  to  be  quite  easily  recognizable.  There  may  be  more  than  one  way  to 
defile  groups,  depending  on  the  size  of  the  groups  we  wish  to  identify  and 

a 

on  tie  general. criteria  we  happen  to  apply.  Any  of  these  choices  can  be 
teste  ,  however,  against  a  visual  inspection  of  the  groups  on  a  map  to 
ascertain  their  logical,  "meaning," 


Tvo  pHmndtfc  vuc  u  ployed, 
to  group  into  rntlicr  dit'.liirt  fitn. 


Ilu  f  J r»* t  contains  c < t- tc ,  cheucr. 
The  r.econd  i  ctrlr.  rouiiiinn  60  cJtift , 


which  arc  feprnad  out  ncrof.fi  the  non. 


11, if.  second  nntrix  nllown  uc  to  ruci-r- 


tnin  the  tendency  of  the  tccl.niMMM  to  describe  group*  v.-hlch  do  not  atm  vo 
be  intuitively  meaningful.  Tallin  I  contninf.  the  Hot  of  ci'  Ur,  for  K  »h 


plnr-modcs. 

2.1  Fnctor  Croups 

We  will  present  too  methods  of  croup  ini;  n.rioiu.  on  their  diet  nice:.. 
The  first  net  hod  is  a  direct  factor  analysis  of  the  distance  matrix  (Hummel, 
1969,  Section  22.2).  The  (mileage)  oistnnccs  arc  first  reeled  to  lie 
between  0+1. CO,  where  1.00  is  the  smallest  distance,  This  transferns  the 
distance  matrix  to  a  similf.rity  matrix,  which  is  then  factor  analyzed 
(principal  components)  *3  though  it  were  a  correlation  •.-.ntrix.  The  result¬ 
ing,  factors  define  cities  whose  pattern  of  distances  from  other  cities  are 
interdependent — similar  in  profile.  Cities  with  high  1  cadi j.gs  or.  the  same 
factor  are  similarly  located  in  space. 

The  factors  can  be  rotated  to  orthogonal  end  oblique  simple  structure 
to  ascertain  the  clearest  definition  of  groups.  Using  factor  analysis  in 
this  fashion  brings  out  distinct  groups:  there  would  be  no  ambiguity  as  to 
the  number  of  types  or  the  membership  of  each  type, 2 


In  employing  factor  analysis,  the  principle  n::>. n  technique  was 
applied  to  the  similarities  (transformed  mileage)  matrix,  notation  to  ortho¬ 
gonal  solution  was  performed  on  three  factors  in  both  experiments.  The  thoi.ee 
of  this  cutoff  will  be  discussed  later.  Tables  IJ.  and  III  present  the  ortho¬ 
gonal  .. otations  of  the  22  city  and  60  city  plasmodcs. 


In  the  22  city  example,  the  factors  define  three  clusters, 
cluster  is  centered  around  Albany,  Kcu  York,  and  defines  a  group  of 
mid-western  cities,  The  second  factor,  centered  around  Colorado  Spr 


The  first 
eastern  a 
irga  and 


nd 


Tabic  1.  List  of  Cities  in  Plasmode  1  and  2 


Plasmode  I 

3  Albany,  N,  Y. 

2  Ashvillc,  Tenn, 

3  Atlanta,  Ga. 

4  Atlantic  City,  N.  J 

5  Baltimore,  Md. 

6  P>  nghampton,  N.Y. 

7  Birmingham,  Ala. 

8  Boston,  Macs. 

9  Buffalo,  N.Y. 

10  Charlotte,  N.C. 

31  Chatanooga,  Tenn. 

12  Chicago,  Ill. 

13  Cincinnati,  Ohio 

14  Cleveland,  Ohio 

15  Colorado  Bps., Colo. 
10  Columbus,  Ohio 

17  Dallas,  Texas 
38  Davenport,  Iowa 

19  P.-.av.  r,  Colo, 

20  Detroit,  Mich. 

21  K1  Faso,  Tex. 


Pin  erode,  II 

1  Albuquerque,  N.  M. 

2  Atlanta,  Ga. 

3  Baltimore,  Md. 

4  Birmingham,  Ala. 

5  Bismarck,  N.D. 

6  Boston,  Mass. 

7  Buffalo,  N.Y. 

8  Cheyenne,  Wyo. 

9  Chicago,  Ill. 

10  Cincinnati,  Ohio 

11  Cleveland,  Ohio 

12  Columbus,  Ohio 

13  Dallas,  Tex. 

14  Denver,  Colo. 

15  Des  Moines,  Iowa 

16  Detroit,  Ilich. 

17  Dubuque,  Iowa 

18  Evansville,  Ind. 

19  Ft.  Wayne,  Ind. 

20  Grand  Canyon,  N.M. 

21  Glacier  Nat,  Park, Mont 

22  Helena,  Mont. 

23  Houston,  Tex, 

24  Indinnpolis,  Ind, 

25  Jack. -ion t  Miss, 

26  Jacksonville,  Fla. 

27  Jefferson  City,  Mo. 

28  Kansas  City,  Mo. 

29  Tansing,  Mich. 

30  Little  Rock,  Ark, 


Plnstrtodc  II(cont.) 

31  Los  Angeles,  Calif. 

32  Louisville,  Ky. 

33  Memphis,  Tenn. 

34  Milwaukee,  Wise. 

35  Minneapolis ,St .Paul , Minn . 

36  Nashville,  Tenn. 

37  New  Orleans,  La. 

38  New  York,  N.  Y. 

39  Omaha,  Neb. 

40  Peoria,  Ill. 

41  Philadelphia,  Pa. 

42  Phoenix,  Ariz. 

43  Pierre,  S.D. 

44  Pittsburgh,  Pa. 

45  Portland,  Ore, 

46  Raleigh,  S.C. 

47  St.  Louis,  Mo. 

48  Salt  Lake,  Utah 

49  San  Francisco,  Cal. 

50  Seattle,  Wash. 

51  Spokane,  Wash. 

52  Springfield,  Ill. 

53  Springfield,  Mo, 

54  Tampa,  Fla. 

55  Toledo,  Ohio 

56  Topeka,  Kansas 

57  Tulsa,  Okla. 

58  Washington,  D.C. 

59  Wichita,  Kan. 

60  Yellowstone,  Wyo. 
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Table 

2. 

Orthogonal  Rotation 

(22  cities) 

Factor 

City 

1 

2 

3 

Albany 

1 

9203 

0284 

2150 

Asheville 

2 

4976 

1939 

7964 

Atlanta ,Ga. 

3 

3830 

2437 

6672 

Atlantic  City 

4 

8420 

0240 

3736 

Baltimore 

5 

8250 

0927 

4366 

Bingham 

6 

9137 

0772 

2673 

Birmingham 

7 

3501 

3416 

8240 

Boston 

8 

8750 

-0600 

2307 

Buffalo 

9 

9059 

2006 

2502 

Charlotte  •  - ' 

10 

5321 

1293 

7733 

Chatanooga 

11 

4410 

3034 

6095 

Chicago 

12 

6719 

4831 

3636 

Cincinnati 

13 

6808 

3631 

5396 

Cleveland 

14 

8346 

2916 

3659 

Colorado  Sp. 

15 

1593 

9253 

1181 

Columbus 

16 

7514 

3411 

4701 

Dallas 

17 

1317 

7084 

5252 

Davenport 

18 

5953 

5701 

3525 

Denver 

19 

2012 

9304 

0743 

Detroit 

20 

7968 

3353 

3571 

El  Paso 

21 

-0745 

8274 

2609 

Erie 

22 

8769 

2409 

3045 
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Table  3. 


Orthogonal  Rotation  (60  Cities) 


Vrti 

.-Me  Cot, 

•suiiali:  y 

Factor 

No. 

!!nr-o  3 

Factors 

1 

2 

JL 

1 

Al’.iiuiiK  fpjUe 

824 

2C9 

645. 

-604 

2 

/ limits 

870 

684 

087 

-629 

3 

Pal tinore 

671 

903 

066 

-227 

4 

tiiningh.  !u 

694 

646 

134 

-678 

5 

Li  snare!; 

601 

505 

720 

-167 

6 

to;.  Ion 

767 

871 

-001 

-095 

7 

Koffalo 

860 

907 

156 

-179 

8 

Clicycuno 

841 

375 

750 

-370 

9 

Chicago 

912 

614 

366 

.340 

10 

Cincinnati 

921 

839 

234 

-402 

11 

Cl<  vel.tml 

924 

904 

216 

-245 

12 

Colvnbus 

922 

870 

214 

-345 

13 

Dallas 

8C9 

379 

387 

-771 

14 

Denver 

835 

345 

722 

-442 

15 

Dos  I’oino.r 

C63 

CM 

511 

-559 

16 

Detroit 

911 

866 

256 

-245 

17 

iHibutju? 

876 

747 

446 

-344 

18 

r.vr.nsvillc 

910 

746 

270 

-530 

19 

Ft.Wnyne 

927 

855 

269 

-336 

20 

Grand  Canyon 

805 

053 

733 

-515 

21 

Glacier  Nat. 

847 

250 

686 

-006 

22 

Helena 

870 

249 

895 

-063 

23 

Hair,  ton 

841 

355 

294 

-793 

24 

Indiana 

928 

820 

285 

-417 

25 

Jackson 

900 

535 

168 

-760 

26 

Jacksonville 

779 

624 

-023 

-624 

27 

Jefferson  City 

891 

643 

407 

-599 

26 

Kansan  City 

C84 

579 

484 

-561 

29 

Lnnsinr, 

899 

664 

286 

-266 

30 

Little  Rock 

908 

538 

300 

-727 

31 

1.03  Anycles 

725 

-07/ 

721 

-446 

32 

Louisville 

9 14 

795 

232 

-479 

33 

Mer'i'ols 

915 

603 

255 

-692 

34 

Hilvankeo 

881 

801 

391 

-295 

35 

Hinn.-Sc.  Pan  1 

£24 

677 

544 

-265 

36 

} achvi lie 

903 

714 

193 

-597 

37 

Kcr  Orleans 

649 

405 

151 

-769 

38 

tlcw  York 

821 

890 

024 

-169 

39 

Of  ’ilia 

850 

595 

566 

-429 

40 

Peoria 

899 

753 

387 

-428 

41 

Mil  lmlelphis 

050 

900 

046 

-191 

42 

Plmtnix 

769 

035 

660 

-566 

43 

Pierre 

818 

507 

6S9 

-269 

44 

Pittr-hiir^’i 

913 

909 

161 

-247 

45 

Port  land 

799 

025 

653 

-008 

7 


Variable  Ccfc_.unulity 

Factor 

No. 

Knafi  3 

Far tors 

? 

_  3 

46 

fcnlcijfh 

613 

605 

015 

-406 

47 

£t. Louis 

909 

702 

354 

-540 

48 

Srl»  )~nl:c  City 

C31 

186 

853 

-296 

49 

Srn  Francisco 

700 

-C7F. 

793 

-239 

50 

Seattle 

C06 

066 

693 

056 

51 

S|»p1:pup 

656 

164 

910 

019 

52 

Cpriiigf  iclJ.l. 

90? 

731 

373 

-478 

53 

Springfield,!*. 

C9C 

5M 

400 

•633 

54 

Tar  pa 

737 

563 

-054 

-645 

55 

Toledo 

930 

69C 

261 

-263 

56 

TopcV.n 

674 

549 

510 

-559 

57 

Trlsn 

864 

464 

436 

-677 

58 

Wat.hlncion 

865 

C92 

066 

-253 

59 

l.'icliita 

873 

474 

5*7 

-617 

60 

Yellowstone 

685 

275 

879 

-190 
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Denver,  i  .1  group  of  liochy  Dorntuin  and  South  Western  cities,  while  the 
thin*  factor  it-  r.  routle'-rn  dJemsiou  rcitern’  aronnJ  At  la  tit..,  Georgia. 

In  the  CO  city  enu  ,  t he  cutoff  ctitcrixi  for  rotation  was  three 
factors.  Tl*e  (cut  tli  factor  c::plri?ted  only  2.7  percent  of  tin  variance  with 
no  )o.  dings  chow  .5.  The  first  e.lumei ,  by  fr.r  the  strongest,  centered 
Around  Pittsburgh,  Cleveland  cm!  lialt i core.  Tlif t*  factor  is  centered  in  the 
Dost  r:id  groups  citjes  both  l.Vst  end  South.  The  second  factor  is  centered  in 
the  terthvcit  at  Spokane  and  covers  the  Dent.  The  third  factor  loosely 
centers  around  Hew  Orleans  and  Hour  ton.  This  factor  hoc  no  high  loadings  oo 
it.  W  c  of  the  cities  r. re  included  within  the  other  two  factors  in  the 
southern  states  both  to  the  cart  etui  west. 

The  inf ovum  tun  presented  in  the  factor  tables  can  also  be  conveyed 
using  a  nap  of  the  United  States.  The  observer  err.  move  easily  check  the 
structuring  of  the  platnooc  experiments  with  hlr.  intuitive  "feel"  for  the 
grouping  of  cities  based  upon  mileage  distances. 

The  groupings  of  cities  are  readily  discernible  in  the  nap  presenta¬ 
tion.  Ihe  contours  or  concentric  circles  on  the  rap  refet  to  factor  loading 
criteria.  Thus,  the  inner,  thin  circle  clusters  ell  cities  with  factor 
loadings  above  .16  on  the  r.aae  factor.  Thu.  other  circles  refer  to  loading 
criteria  of  .71,  .50,  nrd  .30  respectively.  Turning  to  the  groupings,  a 
mid-*/,.  ..torn  group  wigl.t  have  been  expected*  These  mid-western  cities  were 
inclu-  ed  in  the  other  group?,  but  they  did  not  group  into  a  reparate  cluster. 

In  belli  plnr.nodcs  those  cities  would  have  groupeo*  if  a  fourth  factor  hr.d  ’.Ten 
employed.  The  percentage  of  variance  accounted  fo»  in  a  fourth  faetov  in  each 
cast  ;.wS  very  lo-*.  The  loadings  were  below  .500  nr.d  the  fattois  would  not  hove 
been  “stated  under  nort.r.1  criteria  of  the  strength  of  loadings  or  a  SKKKC  tost 
(Cnttell,  J 900)  •  Tiiey  we re  not  employed  in  tha  plansoclo  for  the  sai.e  reasons. 
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Both  factor  structures  wore  rotated  tc  cn  oblique,  bicjwnrtiiiin 
colu t2 3  4 on.  CMiquc  rotation  *..*ac  dtilvcd  to  ascertain  the  relationships 
between  the  p.roup  factors  in  this  analysis.  If  the  fetters  nv<  hlj.1i  1  v 
irtt  rcorrelnted,  each  p.roup  is  over  hipping  the  ether  croups  red  the  factors 
nre  not  equally  dtfinin^  u  cluster  of  tty. .tic.  ll.e  prii  ary  pattern  Matrix  has 
teen  interpreted  in  the  22  city  peril. cuts.  ire  sharp  f.’ctcrr  appeared  with 
n  third  fnctor  cherinp.  so«e  v..oecr/itc  loadings.  The  three  fixture  nre  quite 
siniJcr  to  the  orthogonal  solution  except  thnt  the  first.  factor  lire  si::  cities 
loadfnp  above  .86, while  in  the  orthogonal  cnee  there  were  only  five.  Albany 
and  IHn^h.anton  ore  ot  the  center  of  the  first  cluster.  The  second  fnctor 
centers  around  Denver  and  Colorado  Spt injs, while  the  third  factor  re  n 
southern  factor  loosely  centered  at  Atlanta. 

The  factor  con  elation  Matrix  cho  us  that  factors  1  air'  3  have  n 
moderate  correlation.  Factor  two  in  ilip.htly  lens  related  to  either  factor  1  ci 

123 


2  .2679  _ 

3  .  A322  .3210  _ 

3.  Thv?  the  groups  in  the  Kant  and  the  South  aces.'  cere  associated  with  cjc’i 
other  than  either  is  with  the  western  proup. 

In  contrast  to  the  wild  chr.npcs  assoc  fated  with  the  rIr<M  ir.  total  L-r 
on  th-  22  city  Matrix,  oblique  factor  analysis  ef  tin  60  exty  r.atvi:.  prod  seed 
r.tri'  ]»«•  changes  fro*i  the  ortho. ,annl  analysis.  In  this  placto..-.  there  i  only 
one  veil  defined  cluster.  All  five  o[  the  cities  with  lorvinp*  above*  .66 — 
Filenai  i,  Glacier,  Helena,  r_'.er**o,  and  Yol Jews; tour— a  c  equally  central  to 
the  c*i.r.tcr  defined  by  the*  factor.  Only  cifht  of  the  (0  cili.a  lave  e.  lor  in^ 
of  le*  i  then  .50  on  this  factor.  It  vunld  appear  that  the  on:  factor  accuunlr. 


for  r  ».at  of  the  rp.'cinl  variation  in  cUi«  . 


Oblique  Potation  (22  Cities) 


Table  4. 


Citv 

1 

_ 3 

Albany 

1 

9£07 

-0986 

-0342 

Asheville 

2 

403b 

0386 

6973 

At  Jr.nLn 

3 

2619 

0932 

7997 

Atlantic  City 

4 

CoS  3 

-1173 

1624 

Baltimore 

5 

8343 

-0521 

2 252 

Binfthnnton 

6 

9607 

-0539 

0173 

Kirwltighsn 

7 

2236 

2060 

7469 

Boston 

6 

9352 

-1091 

0098 

buffalo 

9 

9440 

0794 

-0184 

Charlotte 

10 

4  60S 

-0299 

6722 

CllAt  AilOOfcA 

11 

3290 

1581 

7109 

Chicago 

12 

6407 

3830 

1508 

Cincinnati 

13 

6347 

2336 

3375 

Cleveland 

14 

8394 

1665 

1144 

Colorado  Sp. 

lb 

0802 

9410 

-0608 

Columbus 

16 

7261 

2127 

2446 

Dal  In:; 

17 

0009 

6578 

4271 

Davcrporl 

18 

5539 

4872 

1253 

Denver 

19 

1340 

9481 

-1216 

Detroit 

20 

7954 

2178 

1089 

El  Paso 

21 

-1934 

8426 

1778 

Eric 

22 

9004 

1172 

0430 

13 


Te,bl< 


Albuquciquo 

1 

At]  min 

2 

Baltimore 

3 

Birmingham 

4 

lilstir.rh 

5 

Boston 

6 

Buffalo 

7 

Cheyerne 

8 

Chicago 

9 

Cincinnati 

10 

Cleveland 

11 

Columbus 

12 

Dallas 

13 

Denver 

14 

Dcs  I'oincs 

15 

Detroit 

16 

Dubuque 

17 

Evansville 

18 

Ft.  Wayr.fl 

19 

Grand  Canyon 

20 

Glacier  Hat. 

21 

Helena 

22 

Houston 

23 

Indiana 

24 

Jtekson 

25 

Jacksonville 

26 

Jefferson  City 

27 

Kansas  City 

26 

Lansing 

29 

Little  Kock 

30 

Los  Angeles 

31 

Louisville 

32 

Memphis 

33 

Milwaukee 

34 

Minn. -St.  Paul 

35 

Kasliville 

36 

Hew  Orleans 

37 

Hew  York 

3C 

Omaha 

39 

Peoria 

40 

Philadelphia 

41 

Phoenix 

42 

Pierre 

43 

Pittsburgh 

44 

Portland 

45 

Raleigh 

46 

St.  l.ouir 

47 

Salt  Lake  City 

46 

.  Oblique  notation 


JL. 

6470 

4655 

5754 

4823 

8794 

5020 

6509 

8291 
76f’6 
6759 
6964 
6773 
5350 
7865 
7964 
7206 
7959 
6512 
7293 
6286 
8676 
8745 
4449 
7064 
4625 
3414 
7033 
7292 
7323 
5557 
5439 
6476 
5597 
78J  5 
6350 
5700 
4032 
5334 
6056 
7507 
5579 
5650 
8636 
6559 
7438 
47(6 
6943 
8040 


(60  Cities) 


Factor 

2 

3 

5698 

2993 

035? 

3358 

-3337. 

-0902 

1612 

37C5 

1.206 

-2097 

-*363 

-1311 

-3339 

-1607 

3516 

0059 

-0603 

-0464 

-1044 

0403 

-2605 

-1170 

-1710 

-0151 

4845 

4729 

4060 

0664 

1169 

0255 

-2338 

-1231 

-0061 

-0454 

0542 

1746 

-1381 

-0404 

6328 

2410 

2214 

-3255 

2603 

-2576 

4756 

5264 

-0626 

0447 

3055 

4719 

0717 

3612 

1902 

1934 

2600 

1930 

-194? 

-1060 

3257 

4086 

6545 

2221 

-0257 

1242 

2427 

3685 

-094) 

-0502 

01  f  4 

-1236 

0063 

2669 

3249 

5055 

-3848 

-1260 

1914 

0449 

0261 

0464 

-3657 

-1162 

6543 

31)9 

1830 

-1119 

-2845 

-10)2 

.>563 

-2559 

-1713 

1174 

1207 

1728 

450f 

-0325 

13a 


1 actor 


City 

.1 

JL 

J. 

Snn  Francisco 

AO 

6065 

5397 

0120 

Seattle 

50 

7678 

2870 

-3271 

Spokane 

Springfield,!. 

51 

8377 

2639 

-3280 

52 

726A 

068A 

1026 

Springfield,!;. 

53 

66  2 A 

2755 

2811 

Tampa 

5  A 

2809 

1090 

A  2  83 

Toledo 

55 

7271 

-2209 

-1090 

Topekn 

56 

7333 

2866 

1930 

Tulsa 

57 

6359 

3778 

3A12 

l/sshington 

58 

5695 

-3090 

-0631 

Wichita 

59 

6953 

37A1 

2670 

Yellowstone 

60 

8769 

33A7 

-1638 

In  addition  to  the  major  factor,  two  minor  factors  appear.  The 
first  is  centered  around  the:  Grand  Canyon  and  the  second  around  'Jew  Orleans 
and  Houston.  These  two  minor  factors  are  bipolar  in  that:  the  loadings  define 
a  group  of  cities  that  are  positively  loaded  on  each  factor  and  another 
group  that  are  negatively  loaded  on  the  factors.  The  implication  of  bipolar 
factors  will,  become  clearer  when  we  turn  to  the  geographic  displays. 

In  interpreting  the  oblique  rotation  in  the  60  city  plasmode,  the 
reference  structure  matrix  was  employed  rather  than  the  more  customary  primary 
pattern  matrix.  Two  criteria  were  influential  on  this  decision.  The 
primary  pattern  matrix  had  29  loadings  above  .86  and  all  but.  three  loadings 
above  .50.  The  interpretation  of  grouping  procedures  would  have  beer  very 
difficult  with  the  primary  pattern.  The  factor  correlation  matrices  also 
provided  information  for  choosing  between  the  two  rotations. 

The  primary  pattern  correlation  matrix: 

Factor  123 

1  _ 

2  -.44  _ 

3  .58  -.77  _ 

The  reference  structure  correlation  matrix: 

Factor 

1  _ 

2  -.00  _ 

3  -.41  .70  _ 

By  discussing  the  reference  structure  matrix  we  arc  able  to  reduce 
the  correlation  between  factors  1  and  2  and  in  general  reduce  all  correlations 
between  factors.  Because  of  the  very  high  loadings  for  the  primary  factors 
and  the  high  correlations  between  the  primary  factors,  the  reference  structure 
matrix  was  selected  as  giving  a  clearer  simple  structure. 


*1  »•«*•*  S3;ms  dvw  3svs  30000 
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Vi  10  j'lfM-rlplJon  o'  t!ic  two  demonstrates  the 

froupinp,  of  eh),  in  Loth  pj  roder..  In  the  play  ode  with  22  cities  thar*- 
in  I  It  tie  eh; upr  in  til*  nmnyik  location  of  ^loups  f  rev  the  orthogonal 
care.  The  r.o;  t  change  ip  liar  shrinkage  in  riz?  of  the  two  stwillcr 

I*  roups;  both  the  Southern  and  l.Yttvru  clusters  cor.r  nore  sharply  in  focus. 

In  the  60  city  plntt’Otle  rore  rtrikiiv,  chmip.cs  occur,  cr  we  should 
expect,  c.Ivmi  the  rfiff  rencot  vhlch  Appeared  ii  the  interpretation  of  the 
factor  wntricer..  kov,  there  ii;  one  leipe  f.roupinp,  centered  in  tlte  Northwest; 
cities  p.roup  fret;  the  host  and  South  os  the  load  inf,  criteria  is  relaxed  frr 
.60  to  .71  and  filially  to  .SO.  The  two  weak  factors  are  centered  in  the 
Southwest  ami  the  South.  In  grouping  cilia.*  according  to  lo.dji'fc,  two  tstr.ll 
r.ruupp  appear  which  have  opposite  leadings  011  each  of  the  nair.ov  factors.  One 
of  these  p.rcupr.— designated  by  dashed  lines  ir.  in  the  Hast  and  represents 
the  bipolar  loadings  0:1  the  routhi’erterii  fcctor.  The  othet  t/mll  group  it  in 
the  fa**  h'ortl-.Wi nt  and  is  the  reverse  of  the  Southern  cluster. 

While  all  the  groups.  are  intr rpve.table,  oblique  solutions  do  not  se^n 
to  be  quite  nr.  ir.cr.ningful  when  there  arc  no  cli’ar  cut  clusters  of  cities  cs 
in  the  f>0  city  plat.nodc.  The  analysis  tends  to  produce  froi.pc  idiich  ore  It  a 
lntgc  to  interpret  easily.  This  is  due  to  the  fact  that  a  lorfe  group  of 
c* ties  tend  to  have  the  sore  pattern  of  distances  across  all  cities  and  there¬ 
fore  c  ucter  tep.ethcr  even  though  they  r.rr  quita  distant  froci  each  other. 

2.2  i'.ltrarchiea)  Clusterin?.  Sel  ene 

Another  method  for  grouping.  entitles  ir.  based  upon  the  distance 
between  all  points  in  space  label,  one  et  a  tir,e  rather  than  the  patterns 
of  variation  in  distances  across  all  entities  associated  with  factor  coaly  ic. 
Tact  v  analysis  looks  at  a  nntrix  o'  cities  r.:;  0  scries  of  vectors. 
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Thun,  fo ctor  analysis  begins.  with  .1  square  cyrxctrie  c,r.tri\— 
cities  by  cities,  in  this  core — of  sicllaiities.  The  remit i:.,",  factors 
dcfiur  cities  whose-  pattern  of  distances  fien  other  cities  ere  interdependent — 
einilnr  in  profile.  Cities  with  high  loadings  o.t  the  r.r.i ::  fret  or  display 
sirilrr  patterns  of  distances  ilcvn  thejr  respective  colt  n  vectors  end  are 
thus  sitailaily  located  in  space.  In  to 'trait,  this  nc  :  a-tltod  builds  a 
hierarchical  groupin'*  of  cities  and  produce r  a  tr::ono:.iic  tree  ot  dendrp;*r.n. 
with  cities  that  ore  closest  in  distance  on  the  bottom  of  each  hvanch.  This 
tecl  lique  is  closely  associated  with  biological  sciences;  the  Methodology  is 
the  some  os  that  employed  in  building  taxonomies  of  reptiles,  nnn-talr.  or 
Incectc  (Soho!  and  Sncnth,  1963). 

Several  techniques  by  which  o  d  e  nd  rep  :  .71  can  be  Luili  have  been 
published.  The  root  recent  of  then  is  by  S.  C.  Johnson  (1907"‘.  It  subsoucr 
Ward's  (1963)  grouping  technique  and  that  employed  hy  £ot:al  red  Snuath  (19(3). 
The  technique  operates  on  either  n  dictcr.cc  cr  similarities  usirix.  It  groups 
objects  on  the  basic  of  their  distances  frets  each  other.  Thus,  the  closest 
two  paints  in  space  will  be  grouped  together  first.  The  next  closest  entities 
are  •  rouped  secondly,  and  so  forth,  until  oil  entities  ere  in  a  single  group. 
The  hey  to  the.  method  is  replacing  two  (or  sore)  objects  (citii  3)  with  a 
single  entity  (cluster)  that  dcf  Jnes  the  distance  between  the.  newly  fornr  ’ 
clui-tc  and  all  other  cities  01  clusters.  When  two  cities  foe  a  group, 
then  arc  two  alternative  distf.nccs  between  the  group  cu.d  the  ether  entities: 
the  original  distances,  between  both  of  the  cities  that  joined  to  form  ti  • 
grou;  and  the  other  entities  net  in  the  group.  Johuren'r  program  allows  the 
choice,  of  either  the  ca.xirn*  1  distance  (the  greeter  of  t5  .  two  -ti stances) 
or  two  winiir.un  distance,  lie  terms  these  alternative  choices  the  P|e...etcr  and 
Connectedness  methods  respectively.  The  resulting  tr  'c  *ie<  r, tv  invariant 
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under  r.enotorc  tranr.fom.i.ieii*;  of  rho  iir.ilariiy  data.  Thus,  vc  need  have 
only  ran!:  order  confidence  in  e«r  data. 

The  die."  t«  r  noth**!  otti  ipls  to  uinltcixe  the  diameter  of  the  croups. 
Thus  the  dfn-.ntcr  t.ithoJ  builds  groups  by  adding  n  city  *0  a  croup  if  the 
rmxJnua  distance  be  liven  the  city  and  group  petkhers  ir»  smaller  than  that 
between  the  croup  end  other  citice  rot  in  the  croup.  Every  tine  a  group  ic 
foincd  between  two  cities  x  and  y  there  is  a  choice  as  to  which  dirtrnre 
we  should  nee  to  represent  the  distance  frets  the  c^oup  /x,  yf  and  other 
cities  such  an  Z.  lhc  d inset et  method  nahus  the  choice  of  the  mnxlmna  distance 

<K*V.,y)  ,*)  -  ».n\  -jj (:: , r) ,  d(y, ?.))  . 

where  d  *•  distance 
(x,y*>  •*  a  cluster  of  cities  x,v 
z  •*  another  city 

•  •  • 

The  connectcJnecc  licthod.  however,  adds  a  city  to  a  group  the 
winln.n  distance  between  the  city  and  any  r.cnber  cf  the  group  is  imallcr  thnn 
that  between  the  group  and  other  cities  not  in  the  croup.  In  its  choice  of 
distn  res  between  the  group  and  other  cities  the  criteria  employed  ic  the 
niniwui  distance 


d(/>:,y/.r)  t:  oir.  /d(x,7.)  ,d(y,;0/ 
where,  d  •»  distance 

/x,y/  *•  a  cluster  of  cities  x,y 
2  *•  another  city 

The  c-v.r.ec tednesr.  nethoJ  tend*',  to  build  chains  of  entities  that  arc  f ausrgc- 
chapcd.  wi lh  the  length  of  the  sausage  chain  nlninized. 
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Usinj*  the  sar.c  pin anodes  analysed  \* i rh  factor  mssly^is  1  h.  vc  employed 
Johnson's  Hierarchical  Clustering  Scheme  (i!CS)  on  the  original  distance 
esot  rices.  I  will  display  the  results  in  two  fon.it.  First  the  deudrogr.  r 
will  be  described  end  then  the  geometric  interpretations  of  tht  dendrograms 
will  he  presented  on  mp r.  of  tht  United  Sinter.. 

The  foru  chosen  to  prison?  the  rieudroRiT??  needs  clcrif leal  ion.  The- 
prograc;  print-out  includes  a  dendrogram  which  lies  n  lcvtl  for  r cch  different 
distance  (similar! ty)  in  the  ttairix.  These  levels  or  branches  in  the  dendrogram 
are  too  numerous  for  interpretution  and  ro: .par iron  wiili  previous  results.  Ir 
order  to  moke  the  hierarchical  clustering  onnlyrcs  comparable  with,  tho  fa  cl  rv 
analysis  results,  1  calculated  the  menu  (x)  nunVr  of  cities  that  gveupfd  at 
each  of  the  four  levels  of  clustering  in  the  factor  results  (,CC>,  .71,  .50,  rr.d 
.30).  The  ncan  size  of  each  of  the  factor  groups  t;as  used  as  •  criteria  fer 

d  ^ 

defining  groups  in  hierarchical  clustering.  Tor  instance,  tliVYecu  fixe  of 
the  tightest  clur.tt'r  groups  (,C6)  in  the  factor  analysis  ef  the  *12  city  plar.eJe- 
was  three  cities.  The  first  grouping  of  at  least  three  cities  in  the  dendrograms 
was  chosen  an  the  first  cluster.  These  levels  art  identified  as  W  levels  fr.  the 
figures.  At  each  K. level  additional  groups  that  have  formed  since  the  las: 
level  arc  identified  regardless  of  sjr.c. 

The  dendrogram  for  both  I'CS  toehui'juor  on  the  22  city  plasnode  are 
reported  in  figures  V  and  VI. 

The  diancter  Method  lie*  grouped  all  cities  haforv  the  J.-.st  level  (22:”). 

The  rev  meet  educr.s  Method  docs  not  pull  in  FI  iMso  anti  Dallas  until  the  If!  level. 
Figures  VII  and  VIII  present  the  geometric  displays. 

The  dlnr.atet  method  groups  cities  into  4  r  ntringCul  gri  ipr.;  the  1 
Hidv.st,  South,  and  l.’cst .  The  eoniiectouiw  ;t  i»ethoJ  do.-o  not  :  ’  .  r-  clear  a 

grouping  of  cities,  however.  The  tad j or  grouping  in  in  the  1  t  Midwest  or:,  r.r 
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bill  the  subgroupings  inside  this;  area  art  not  clear.  For  instance,  Cincinnati 
and  14c*;; t cm  nr«  in  th  r:  a  group,  but  Atlantic  City  and  Baltimore  do  not  joi 
this  group  until  1  iter.  It  would  appeal*  that  in  this  case  the  diameter  method 
Croup.*;  cities  into  neater  clusters  and  tends  to  account  £or  all  the  cities  at 
lower  K  levels  than  does  the  connectedness  method. 


the  60  city  matrix  of  distances  was  analyzed  in  the  same  manner. 

Figures  1X-X  present  the  dendrograms  and  figures  XI-X1I  display  the  geometric, 
interpretations  of  the  60  city  plasmode. 

In  the  connectedness  method,  16  cities  ere  grouped  only  at  the  CON  level 
in  the  dendrograms,  while  all  cities  are  accounted  for  in  the  diameter 
method  prior  to  the  60N  levels,  hooking  at  figures  XI  and  XII — the  results  of 
groupings  presented  in  geometric  representations — the  connectedness  method  does 
not  *  roup  several  of  the  cities.  That  is,  several  cities  do  not  belong  to  any 
of  the  groups  delineated  by  the  diameter  method.  It  does  group  the  large  bloc 
of  mi  ('.western  cities  which  did  not  come  together  in  the  factor  analyses  of  this 
matrix,  however.  The  diameter  method  (Figure  XII)  groups  all  the  cities,  but  it 
results  in  some  which  are  difficult  to  interpret.  For  instance,  the  divisions 
on  the  Kansas  -  Missouri,  border  are  difficult  to  substantiate.  The  breakdown 
of  grout .  in  the  midwest  is  quite  misleading. 

2 . 3  Conclusions  from  Plasmodcs 

\  e  have  presented  two  different  plasmodcs.  The  first— 22  cities — 
plasmcde  cental.  *d  clearly  recognizable  groups.  The  second  plaumode  on  60  cities 
did  r.nt  contain  a  simple  grouping.  There  was  a  high  density  area  in  the.  midwest 
and  an  almost  equal  dispersion  of  cities  on  the  map.  Several  suggestions  can  be 
made  on  tiio  advisability  of  using  each  of  the  four  types  of  analysis  on  a  matrix. 

it  would  appear  that:  direct  factor  analysis.  of  a  rescaled  distance  matrix 
is  r.or-  applicable  than  the  HCf.  techniques  in  both  the  simple  structure  and  the 


complexly  structured  case. 


Ulien  simple  structure  exists  in  the  date,  oblique 
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dimensions  arc  r  better  referent  system  than  the  orthogonal  dl**cn  for'.  Kote 
the  shrinkage  of  r lusters  fren  o:thogo:ial  to  oblique  in  the  22  city  plarrode. 
When  the  date  points  do  not  clearly  cluster  Jn  space,  otthogonal  rotation 
appears  tore  useful  in  definin'4,  groups  thru  oblique  factors.  Oblique  rotation 
is  not  cnploycd  by  all  rnolyr.tr  who  use  factor  analysis.  lor  cxarplc,  Turt 
(19/*0,  p.  266)  has  argued  fer  orthogonal  rotation  rather  than  oblique.  !*ron 
our  2  plasnodcs,  it  vould  sect:  that  the  trgirr.ent  is  veil  founded  vhen  simple 
structure  does  not  exist  in  the  deta.^ 

In  the.  1ICS  analyses,  it  vould  appear  that  the  die*  <-tcr  method  1l  better 
when  clusters  nre  clear  and  unambiguous.  The  22  city  case  points  to  this 
conclusion.  On  the  otli.r  bend,  the  dir.r’tov  net!  od  tends  to  break  dent  when 
there  is  no  apparent  structure  ns  in  the  CO  city  plan  *odc.  One  distinct 
advantage  of  the  HCS  technique  is  that  both  criteria  group  cities  in  the  highly 
dense  nidwestern  section  of  the  60  city  plasnodc.  The  vcaearchcr  who  finds  he 
has  «  large  group  of  entitles  all  clustering  at  one  point  in  the  space  defined 
by  his  data  matrix  might  vish  to  loot;  et  both  connectedness  and  dimeter  Methods 
and  to  choose  the  more  intcrpretoblc  of  the  two  tcchriqucs,  if  he  wishes  to 
make  distinctions  within  his  highly  clustered  group. 

To  reiterate,  chc  direct  factor  analysis  techniques  secti  more  appealing 
than  PCS  techniques  in  describing  the  structure  of  the  space  def  l  tied  by  the 
plas.’ocijs.  The*  choice  between  oblique  or  orthogonal  solutions  depends  upon  the 
complexity  in  the  data.  1ICS  is  a  useful  technique  when  the  researcher  wishes 
to  break  up  a  dense  cluster  of  entities  with  the  dimeter  net  hod  being  slightly 
more  aoplicable  in  this  case. 
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3.  StifiST/.  »l  I Vl.  L'l.V.PU 

Am  analysis  of  rviPirt  data  vill  tc  presented  to  extend  the  finding 
01  tie  |  l«r  ode  cxperji  (•  tf.  to  inter national  relations.  Data  have  been 
collected  on  %stHe  conflict  behavior  from  the  Kew  York  Tim* :  for  1963.  Tkm 
reuniting  i  ntrix  was  26  v,  rJahl«t»  for  dyadic  conflict  over  275  dy««dc.  This 
nniri**  mu  fttCMritl  (principal  components)  end  rotated  through  orthogowil  ar.f 
ol»lif,ue  solutions  (tinl  l  and  Flumel ,  1960).  Of  the  275  nations  in  the  study, 
only  01  had  n  significant  r ,  ;owul  of  conflict  on  tt  least  one  of  the  five 
dJtiu  .•  .ion:..  Sif.nJf  Icrncc  iu  defined  nr  a  factor  score  of  1.50  or  hifher.  Two 
of  the  Hull  and  Ku'racl  factors  will  be  employed  in  this  analysis:  Unoffici- 
incidence  of  violence  end  negative  communication.  Kegntivo  eccnunicat ion  1j 
indexed  by  such  variables  as  neco rat  ions,  and  protects,  while  unofficial 
vie}. -nee  ir.  related  to  attnci’s  on  cabaisies,  persons,  or  the  fin".  Only  these 
two  dimensions  were  used  in  order  to  keep  the  geometric  interpretations  elm,  lc 

Factor  scorer-  for  each  of  the*  61  nations  exhibiting  significant  ceaflict 
in  the  original  matrix  were  organized  into  n  new  matrix.  The  order  of  this 
new  matrix  was  61  by  2:  61  nation  dyads  ar-  rows  and  2  factor  columns.  These 

scores  locate  each  dyad  in  the  space  of  tko  two  conflict  dimensions. 

Once  the  dyads  are  located,  the  next  step  is  to  delineate  their  ci.-twace 
from  eh  of  the  other  dyads  in  the  conflict  space.  The  Fuclidesn  distance 
aicnsi'-.c  has  gained  a  goad  deal  of  support  ns  n  similarity  measure  (Cronhnch 
end  G-cser,  1953;  Kunnally,  1962),  It  measures  both  elevation  (profile  nvevr.se) 
ond  rertter  (profile  standard  deviation)  similarity  ns  well  as  similarity  iu 
piofile  shape.  Thus,  it  determines  precisely  the  congruence  of  spneia)  lecctiors. 
The  distance  measure  is: 


where  S^jj  •*  dyad  b's  scori  on  the  t* 

.  factor 

Sjj,  *»  Dyad  A' 5  score 

p  «  the  nunhor  of  factor  dofinii’p, 
the  spate 

In  this  core  only  too  dimensions  arc  employed.  Tin*  tcchniqu  Jr.  gnimraliaable 
to  or;  nnny  dimensions  os  ore  required  in  define  the  space,  however. 

Once  a  wstrir  of  distances  liar,  been  developed,  the  clustering  of  dyad**  ctm 
be  dctcrr.iincd  in  the  some  wanner  as  has  been  done  in  the*  pins  odes,  rigut-  ’II 
-hows  the  steps  undergone  in  the*  nnalyris.  Ihe  geometric  interpretations  of 
the  orthogonal  solution  are  presented  in  Figure  XIV. 

Tim  orthogonal  factors  define  three  clusters.  The  major  factor  is  ci-.ure<* 
nrour.d  the  axis.  Much  of  the  group  is  located  in  the  negative  quadrant  o,  Toth 
negative  communications  and  unofficial  incidences  of  violence,  however.  This 
counter  intuitive  observation  dees  not  mean  that  the  dyads  display  a  negative* 
amount  of  behavior  used  to  name  these  factors.  It  signifies  that  while  the;  have 
no  occurrence  of  these  forms  of  conflict,  they  do  exhibit  conflict  h.dt.v * ■'**  n 
other  variables  which  have  some  slight  negative  lording  on  the  two  dimensions 
dl  scussed  here.  The  other  two  factors  define  groups  of  dyads  which  exhibi* 
high  summits  of  either  negative  coriKomicat  Jon  or  unofficial  acts  of  violence, 
but  not  both. 

The  information  presented  in  the  factor  tables  is  also  convoyed  using  n 
geometric  interpretation.  The  coordinates  in  this  presentation  are  the  dine. nsic-t 
of  dyadic  conflict  behavior  delineated  in  the  earlier  study  1  and  Runnel, 
1968).  The  dyads  have  been  plotted  in  this  spr.ee  employing  their  factor  scorer. 
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The  concentric  drcJfs  or  contour:  represent  the  factor 
loading  critoiii:  employed  pn.viously  of  .86,  .73,  .50,  ,30,  respectively. 

Turning  to  the  oblique  structure,  v:e  rve  able  to  interpret  the  primary 
pattern  matrix.  T!ie  clustering  of  t Iio  tuo  groups  representing  cxlruae 
behavior  on  either  of  the  diNcnsions  Is  r.orc  specific  to  the  r>: Irene  ends  of  t!ie 
conflict  factors.  This  1c  the  case  in  llie  or thegon;  I  presentation.  The  group 
clustering  around  the  axis  has  enlarged  rligl  tly.  There  see  r.  to  be  a  tendency 
toward  one  factor  when  a  large  percentage  of  t ire  points  are  grouped  togethci  . 

In  contrast  to  the  60  city  pins node  there  are  tuo  definite  cluster:;  which 
remain  equally  important  in  the  oblique  rotation  as  they  v.vre  originally  in  the 
orthogonal  case. 

The  angle  between  the  two  conflict  behavior  dimensions  is  90  de;;i-ar*  ^n 
the  fconetric  presentation.  This  reflects  the  fact  that  the  dimensions  were 
'.ic tu ally  independent  (Hall  and  Rutaiiel ,  1968). 

A  factor  correlation  matrix  is  computed  in  oblique  rotation  that 
delineates  the  amount  of  correlation  between  the  factors,  hall  .and  Russo]  found 
s  0,00  correlation  between  negative  communication  and  unofficial  Incidence  of 
violence.  The  same  procedure  was  employed  to  ascertain  the  relationship? 
between  the.  group  factors  in  this  analysis.  If  the  factors  are  highly  inter- 
cor related,  each  group  is  overlapping  the  other  groups  and  the  factors  are  not 
equally  defining  a  cluster  of  dyads* 

Correlations  Dctworn  Primary  l-’oc tors: 

Factor  123 

1  _ 

2  .475  _ 

.622  . 
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j'a!>) o  7.  Oblique*  Flotation  of  Conflict  Dyads 
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As  the  t’tor  trie  interpretation*;  would  lead  os  to  expect,  the  two 
extreme  factors  are  move  closely  related  to  the  factor  centered  around  the 
axis  than  they  arc:  to  each  other.  Just  a  little  over  6  per  cent,  of  the 
variance  of  the.  two  extreme  factors  are  in  common  while  better  than  16  per  cent 
of  the.  variance  in  both  cases  is  in  common  with  the  factor  centered  around  the 
axis . 

Another  test  of  the  usefulness  of  this  technique  is  to  designate  the 
dyads  which  exhibited  high  amounts  of  behavior  on  either  of  the  conflict 
dimensions.  Dyads  displaying  a  high  amount  of  negative  behavior  were:  Cuba 
to  United  States,  Soviet  Union  to  United  States;  China  to  United  States;  China 
to  Soviet  Union;  and  Indonesia  to  China.  The  high  incidence  of  negative  com¬ 
munications  in  these  directed  dyads  is  certainly  plausible.  Turning  to  high 
levels  of  unofficial  violence,  these  dyads  are:  Venezuela  to  United  States; 
Indonesia  to  United  Kingdom;  Columbia  to  United  States  and  Venezuela  to  United 
Kingd  mi.  Venezuelan  rebels.  (FLAN)  staged  several  anti-American  and  British 
attacks  in  early  1963.  Indonesians  also  demonstrated  against  the  British 
sponsored  federation  of  Malaysia. 

Ti  ■  hierarchic.!  clustering  technique  was  also  applied  to  the  distance 
matrix  d.:  ived  prior  to  factor  analysis.  Both  the  dendrograms  and  geometric 
interpretation  will  be  presented  here.  Tnc  connectedness  technique  is  presented 
first.  The  countom:  lines  for  N  levels  are  derived  by  obtaining  the  mean  si?:, 
of  ea^h  of  the  factor  contours  (at  loading  levels  of  .86,  .71,  .30,  .30)* 

Quite  few  of  the  dyads,  especially  those  with  high  scores  arc  not  grouped  using 
the  c  ''ueoj'.cdjie.ss  technique.  The  groups  themselves  tend  to  be  intuitively 
nccoj-  tab! u .  There  are  exc  eptions,  however.  Dyads  A 2  and  39  (USA-Cuba ; 

Pak-lu.d)  do  not  see  to  he  separate  spec rally  from  the  other  large  group  near 


them. 
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The  diameter  Method  clusters  almost  all  dyads  before  the  last  level. 


The  groups  seem  more  in  ter  pro  table  and  would  suggest  that  in  this  case  the 
diameter  method  is  more  applicable  than  the  connectedness  method .  The  clusters 
seem  to  define  groups  in  terras  of  the  incidence  of  either  negative  behavior  cr 
unofficial  violence  that  the  dyads  exhibited. 

This  case  seems  to  be  one  in  vfnich  there  is  a  clearly  defined  structure 
to  the  matrix.  Therefore,  the  groups  are  quite  distinct.  There  is  one  general 
group  with  a  series  of  smaller  groups  on  both  of  the  original  conflict  dimensions. 
We  have  a  combination  of  the  characteristics  in  the  two  plasmodes.  There  are. 
clearly  recognizable  groups  in  the  conflict  case  as  in  the  22  city  plasmode. 

On  the.  other  hand,  the  conflict  case  displays  a  high  density  area  similar  to  the 
mid-western  group  of  cities  in  the  60  city  plasmode.  Clearly  both  orthogonal, 
and  oblique  factors  are  interpretable  in  this  case.  The  choice,  depends  upon 
the  substantive  interests  of  tlie  analyst.  Factor  analysis  has  grouped  the  dyac’.s 
with  little  activity  on  these  two  dimensions  into  a  large  cluster.  Hierarchical 
clustering  makes  finer  distinction  within  this  group. 

In  HCS  analysis  it  v?ould  appear  that  the  diameter  method  has  produced 
clear  and  unambiguous  cluster.  The  connectedness  method  has  left  many  dyads 
unaccounted  for  and  has  produced  some  counter  intuitive  groupings. 

These  findings  are  consistent  with  those  in  the.  plasmode  experiments. 

The  researcher  is  advised  to  investigate  tlie  usefulness  of  both  direct  facte ••• 
analysis  and  hierarchical  clustering  techniques  to  his  analysis  before  using 
only  one  or  the  other.  The  ultimate  test  of  the  usefulness  of  either  technique 
rests  in  the  ability  to  organize  or  explain  new  data  independently  collected. 
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Footnotes 


1.  See  Geoffrey  II.  Rail,  "Data  Analysis  in  the  .Social  Sciences:  ’..'hat  about 
the  Details?  in  Proceedings  -  Fall  Joint  Computer  Conference,  (3903) . 

2.  For  a  description  of  this  nppronc!  to  group? n» ,  see  Pummel,  Applied  Fr_c top 

Analysis.  Chapter  22  (1909). 

3.  For  a  comparison  of  orthogonal  and  oblique  rotation  arid  a  discussion  of  the 
inter pro lability  of  each,  see  Pummel,  1969,  Ch  nter  16  and  Harman,  II. 
(1967). 

6.  For  details  of  matrix  manipulation,  see  Davies  (1963),  or  Horst,  1963. 

5,  Biquartimin  solutions  ware  calculated. 


